Learning Rules from Multiple Instance Data: Issues and Algorithms
نویسندگان
چکیده
In a multiple-instance representation, each learning example is represented by a “bag” of fixed-length “feature vectors”. Such a representation, lying somewhere between propositional and first-order representation, offers a tradeoff between the two. This paper proposes a generic extension to propositional rule learners to handle multiple-instance data. It describes NAIVE-RIPPERMI, an implementation of this extension on the rule learning algorithm RIPPER. It then explains several pitfalls encountered by this naive extension during induction. It goes on to describe algorithmic modifications and a new multipleinstance coverage measure which are shown to avoid these pitfalls. Experimental results show the benefits of this approach for solving propositionalized relational problems in terms of speed and accuracy. keywords: Multiple-instance learning problem, rule learning, propositionalization, relational learning, mutagenesis learning task
منابع مشابه
Noise-Tolerant Rule induction from Multi-Instance data
This paper addresses the issue of multipleinstance induction of rules in the presence of noise. It first proposes a multiple-instance extensions of rule-based learning algorithms. Then, it shows what kind of noise can appear in multiple-instance data, and how to handle it theoretically. Finally, it describes the implementation of such a noise-tolerant multiple instance learner, and shows its pe...
متن کاملINTEGRATED ADAPTIVE FUZZY CLUSTERING (IAFC) NEURAL NETWORKS USING FUZZY LEARNING RULES
The proposed IAFC neural networks have both stability and plasticity because theyuse a control structure similar to that of the ART-1(Adaptive Resonance Theory) neural network.The unsupervised IAFC neural network is the unsupervised neural network which uses the fuzzyleaky learning rule. This fuzzy leaky learning rule controls the updating amounts by fuzzymembership values. The supervised IAFC ...
متن کاملSolving multiple-instance and multiple-part learning problems with decision trees and decision rules. Application to the mutagenesis problem
In recent work, Dietterich et al. (1997) have presented the problem of supervised multiple-instance learning and how to solve it by building axis-parallel rectangles. This problem is encountered in contexts where an object may have different possible alternative configurations, each of which is described by a vector. This paper introduces the multiplepart problem, which is more general than the...
متن کاملIFSB-ReliefF: A New Instance and Feature Selection Algorithm Based on ReliefF
Increasing the use of Internet and some phenomena such as sensor networks has led to an unnecessary increasing the volume of information. Though it has many benefits, it causes problems such as storage space requirements and better processors, as well as data refinement to remove unnecessary data. Data reduction methods provide ways to select useful data from a large amount of duplicate, incomp...
متن کاملیادگیری نیمه نظارتی کرنل مرکب با استفاده از تکنیکهای یادگیری معیار فاصله
Distance metric has a key role in many machine learning and computer vision algorithms so that choosing an appropriate distance metric has a direct effect on the performance of such algorithms. Recently, distance metric learning using labeled data or other available supervisory information has become a very active research area in machine learning applications. Studies in this area have shown t...
متن کامل